Prioritized Multi-View Stereo Depth Map Generation Using Confidence Prediction

نویسندگان

  • Christian Mostegel
  • Friedrich Fraundorfer
  • Horst Bischof
چکیده

In this work, we propose a novel approach to prioritize the depth map computation of multi-view stereo (MVS) to obtain compact 3D point clouds of high quality and completeness at low computational cost. Our prioritization approach operates before the MVS algorithm is executed and consists of two steps. In the first step, we aim to find a good set of matching partners for each view. In the second step, we rank the resulting view clusters (i.e. key views with matching partners) according to their impact on the fulfillment of desired quality parameters such as completeness, ground resolution and accuracy. Additional to geometric analysis, we use a novel machine learning technique for training a confidence predictor. The purpose of this confidence predictor is to estimate the chances of a successful depth reconstruction for each pixel in each image for one specific MVS algorithm based on the RGB images and the image constellation. The underlying machine learning technique does not require any ground truth or manually labeled data for training, but instead adapts ideas from depth map fusion for providing a supervision signal. The trained confidence predictor allows us to evaluate the quality of image constellations and their potential impact to the resulting 3D reconstruction and thus builds a solid foundation for our prioritization approach. In our experiments, we are thus able to reach more than 70% of the maximal reachable quality fulfillment using only 5% of the available images as key views. For evaluating our approach within and across different domains, we use two completely different scenarios, i.e. cultural heritage preservation and reconstruction of single family houses.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Analysis of Multi-view Generation from Stereoscopic Images using the Depth Map

In this work, we generate multi-view images from stereoscopic texture images. After we estimate a disparity map using a stereo matching method, we detect an occlusion area in the acquired disparity map and refine the initial disparity values of the occlusion area. The refined disparity map is enhanced by a joint bilateral filter for boundary matching to improve the quality of synthesized multi-...

متن کامل

Virtual View Synthesis Using Moving Multi-Camera Array

Considerable improvements of information technologies (IT) allow consumers to enjoy various forms of multimedia services. Recently, 3D video has been established as a new format that provides more realistic and natural experiences to users by free-viewpoint TV (FTV). One of the main challenges is about rendering continuous viewpoint images using color and depth information. Thus, image interpol...

متن کامل

Generation of Multiple Depth Images from a Single Depth Map Using Multi-baseline Information

In this paper, we propose an algorithm for generation of multiple depth images from one depth map using multi-baseline information. Although depth information is essential in three-dimensional (3-D) applications, the depth estimation from stereo matching is still time consuming and inaccurate. Since the real-time application like free viewpoint TV (FTV) require multiple depth information, stere...

متن کامل

Survey on Benchmarks for a GPU Based Multi Camera Stereo Matching Algorithm

Stereo matching algorithms and multi camera reconstruction algorithms are usually compared using benchmarks. These benchmarks compare the quality of the resulting depth map or reconstructed surface mesh. We describe the differences between several known stereo and multi-view stereo benchmarks and their various datasets. Also the modifications that are necessary to use our own GPU based multi ca...

متن کامل

Three-dimensional Video Generation for Realistic Broadcasting Services

In this paper, we propose a new scheme to generate multi-view video-plus-depth using a hybrid camera system, which is composed of one depth camera and multiple video cameras. In order to create the threedimensional (3-D) video, we first calculate the initial disparity for each view by projecting depth camera data onto each video camera using 3-D image warping. Then, a stereo matching algorithm ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2018